Performance Confidence Estimation for Automatic Summarization

نویسندگان

Annie Louis

Ani Nenkova

چکیده

We address the task of automatically predicting if summarization system performance will be good or bad based on features derived directly from either singleor multi-document inputs. Our labelled corpus for the task is composed of data from large scale evaluations completed over the span of several years. The variation of data between years allows for a comprehensive analysis of the robustness of features, but poses a challenge for building a combined corpus which can be used for training and testing. Still, we find that the problem can be mitigated by appropriately normalizing for differences within each year. We examine different formulations of the classification task which considerably influence performance. The best results are 84% prediction accuracy for singleand 74% for multi-document summarization. Disciplines Computer Sciences Comments Louis, A. & Nenkova, A., Performance Confidence Estimation for Automatic Summarization, 12th Conference of the European Chapter of the Association for the Computational Linguistics, March-April 2009, doi: anthology/ E09-1062 This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/724

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

    Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

Large-Margin Determinantal Point Processes

 Investigate determinantal point processes (DPPs) for discriminative subset selection  Proposemargin based parameter estimation to explicitly track errors in selecting subsets  Balance different types of evaluation metrics, e.g., precision and recall  Improve modeling flexibility by multiple-kernel based parameterization  Attain state-of-the-art performance on the tasks of video and docume...

متن کامل

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation

In this paper, we propose a new method of automatic speech summarization for each utterance, where a set of words that maximizes a summarization score is extracted from automatic speech transcriptions. The summarization score indicates the appropriateness of summarized sentences. This extraction is achieved by using a dynamic programming technique according to a target summarization ratio. This...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Performance Confidence Estimation for Automatic Summarization

نویسندگان

چکیده

منابع مشابه

A survey on Automatic Text Summarization

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

Systematic literature review of fuzzy logic based text summarization

Large-Margin Determinantal Point Processes

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation

عنوان ژورنال:

اشتراک گذاری